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ABSTRACT 

We present a parameterized model of the intra-cluster medium that is suitable for 
jointly analysing pointed observations of the Sunyaev-Zel'dovich (SZ) effect and X-ray 
emission in galaxy clusters. The model is based on assumptions of hydrostatic equilib- 
rium, the Navarro, Frenk and White (NFW) model for the dark matter, and a softened 
power law profile for the gas entropy. We test this entropy-based model against high 
and low signal-to-noise mock observations of a relaxed and recently-merged cluster 
from ./V-body/hydrodynamic simulations, using Bayesian hyper-parameters to opti- 
mise the relative statistical weighting of the mock SZ and X-ray data. We find that 
it accurately reproduces both the global values of the cluster temperature, total mass 
and gas mass fraction (/ gas ), as well as the radial dependencies of these quantities 
outside of the core (r > 100 kpc). For reference we also provide a comparison with 
results from the single isothermal j3 model. We confirm previous results that the single 
isothermal /3 model can result in significant biases in derived cluster properties. 



Key words: 

methods: data analysis 



galaxies: clusters: intracluster medium. 



1 INTRODUCTION 

The Sunyaev-Zel'dovich effect is a distortion of the Cos- 
mic Microwave Background (CMB) spectrum by inverse 
Compton scattering off a hot population of electrons, such 
as in the intra-cluster medium (ICM) of a g alaxy cluster 
jSunvaev fc Zel'dovichlll970l ; lBirkinshawlll999l ). The SZ ef- 
fect is observable as an anisotropy in the CMB and has a 
surface brightness that is proportional to the line-of-sight 
integral of the gas pressure of the cluster, independent of 
redshift. Since the surface brightness is independent of red- 
shift, SZ observations provide a powerful tool to probe the 
properties of the largest virialised structures in the Universe. 
Over the past decade there has been considerable observa- 
tional data gather ed on the ICM properties of clusters via 
the SZ effect (e.g. iGrego et alj|200ll : lLaRoque et al.ll2006t ) 
and we are now entering an era of deep cluster surveys with 
specifically desig ned SZ telescopes, including the South Pole 



In order to interpret the data from SZ surveys we 
require well-calibrated scaling relations between the SZ 
signal and physical properties and a good physical un- 
derstanding of the intrinsic scatter therein. Pointed SZ 
observations with interferometer and bolometer arrays 
coupled with X-ray imaging and spectra allow us to 
probe the intra-cluster medium, while weak and strong 
gravitational lensing observations directly measure the 
projected gravitational potential and hence total mass 
of the cluster. Recently, there has been concerted ef- 
fort in developing physically motivated analytic models 
that account for the observed non- i sothermality in clus- 
ter gas [see e.g. Komatsu fc Seliakl 200ll; Ostriker et al.l 



_gai 



Telescope CS PT. iRuhl et al.ll2004f) the Sunyaev-Zel'dovich 



4) 



Array (SZA, iMuchovei et al.l 2007 h the Atacama Cosmol 



ogy Telescope (ACT, iKosowskvl 120031) a nd the Arcminute 
Microkelvin Imager (AMI. IZwart et al.l 120081 '). These sur- 
veys will make use of the redshift independence of the SZ 
surface brightness to map the distribution and evolution of 
large scale structure throughout the Universe. 
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120051 : lAtrio-Barandela et al.l l2008: Bul bul et al.ll2010l ( poly- 
tropic equatio n of state), iPo intccout eau et al.l 12004 : 

IVikhlinin et al.l 120061 : iMahdavi et al.ll2007l (component sep- 
aration and modified 8 models) , and Naeai et al.l 120071 : 
iMroczkowski et al.l 12003 : lArnaud et al.l I2OIO1 (explicit pres- 
sure parametrization)]. 

We present a model in this work that avoids using an 
explicit temperature and electron density parametrization 
that may lead to non-physical cluster properties on large 
scales. We instead choose to base our ICM model on a sim- 
ple parametrization of the cluster entropy, consistent with 
X-ray observations and cluster theory of spherical shock 
accretion and coolin g. We use the NFW parametrization 
IjNavarro et al.ll 19971 ) for the total mass distribution and as- 
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sume hydrostatic equilibrium. We outline our model with 
reference to pointed SZ observations with the Cosmic Back- 
ground Imager 2 experiment (CBI2) located at the Chaj- 
nantor observatory in the Atacama Desert, Chile. This was 
an interferometer operating at 26-36 GH z, with 10 x 1 GHz 
channels, and thirteen 1.4 m antennas IjPadin et al] 120021 . 
Taylor et al. in prep.). CBI2 has a 28 arcmin field of view and 
6arcmin resolution so that at moderate redshifts (z ~ 0.3) 
it can observe out to the largest radii of galaxy clusters. In 
future work we will present the application of the procedure 
outlined here to observations of a number of massive galaxy 
clusters with CBI2. 

We adopt a ACDM cosmology with fi M = 0.3, Q\ = 0.7 
and Ho — 70kms~ 1 Mpc~ 1 throughout this paper unless 
otherwise stated. 



2 MODELLING THE INTRA-CLUSTER 
MEDIUM 

2.1 Current Models of the intra-cluster medium 

In the past most of the joint analysis of SZ and X-ray 
data from observations of galaxy clusters made use of the 
isothermal /3 model prescription for m odelling the ICM 
|Cavaliere fc Fusco-Femiandll976l , ll978r >. This model is typ- 
ically used to fit regions of the cluster within r2soo, the ra- 
dius at which the mean enclosed density is 2500 times the 
universal critical density at the redshift of the cluster. At 
relatively small radii the isothermal /3 model is found to be 
accurate in reproducing the average obs ervable and physi- 
cal properties of many clusters (see e.g . iGrego et al] |2000| ; 
iReese et al.ll2002l : lLaRoaue et al.ll2006n . However, deep X- 
ray spectral obse rvations of nearby cl uster samples using 
Chandr a (see e.g. | Vikhlinin etUD feooe') and XMM-Newton 



(see e.g. lPratt et al.ll2007r i show that at larger radii the ICM 



temperature declines with radius. Hence for low-resolution 
pointed SZ observations that are sensitive to the outskirts 
of the cluster gas, there is a clear requirement for a more 
physically motivated non-isothermal model that is simple 
enough to be constrained by the data. 

There have been a number of recent approaches 
amongst SZ and X-ray observers to relax the assumption 
of isothermality in clusters and model the ICM in a phys- 
ical way. For Chandra obse rvations of nearby relaxed clus- 
ters, IVikhlinin et al] IJ2006T ) developed a modified /3 model 
of the electron density to fit the observed cores and steeper 
outer profiles of relaxed clusters. In that work the tem- 
perature is parameterized by a broken power law at large 
radii and d e clines in the central cooling region as given by 
I Allen et _al] (1200l|). The jaco software pipeline developed 
by iMahdavi et al] 1)20071 ) uses separate parametrization of 
the gas, dark matter and stellar components, assuming hy- 
drostatic equilibrium, to jointly fit to high significance X- 
ray spectra from Chandra and XMM-Newton, weak lensing 
shear from Canada- France-Hawaii telescope, optical data on 
the brightest cluster galaxy from the Hubble space tele- 
scope and the SZ effect from the Cosmic Background Im- 
ager. For sufficiently resolved SZ maps, the temperature and 
mass profiles can be rec onstructed by de-pro j ection tech- 
niques, for example see lAmeglio et al] IJ20071 . 120091) for a 
comparison of the method to simulations and iNord et al] 



(2009) for direct application to SZ and X-ray data. How- 
ever with many SZ experiments the data are generally not 
very well resolved, and so combined analysis with X-ray 
surface brightness data requires the application of a pa- 
rameterized model that reproduces the observed tempera- 
tures and masses from X -ray observations and simulations. 
iMroczkowski et al] (J2009J) successfull y use an NFW para m- 
eterized pressure profile developed bv lNagai et al] (120071 ) to 
fit to data from the Sunyaev-Zel'dovich Array (SZA), and 
then combine with X-ray surface brightness to reproduce 
temperature pro files out to large radi i. This model has also 
been adopted bv lArnaud et al] Q2010J) in order derive a uni- 
versal pressure profile from XMM-Ne wton observat i ons o f 
the REXCESS cluster sample. Recentlv lBulbul et al] (|2010h 
have developed an analytical cluster model based on assum- 
ing a polytropic relationship between the gas density and 
temperature, and parameterise the total mass using a gen- 
eralised NFW model. In order to account for cool-core be- 
haviour the resulting ra dial expressions are mo dified by the 
same core taper used bv lVikhlinin et al] (|2006l ). 

The CBI2 array has relatively low spatial resolution, 
and a large field-of-view, and hence a simple parameterized 
model is required that is accurate over the bulk of the volume 
of the cluster. We could choose any function of gas density 
and temperature as the basis for our gas model, e.g. pressure 
(oc Tn e ) or entropy (ex Tn^ ). For example iNagai et al] 
(2007) choose to use a four-parameter model for the cluster 
pressure profile. We choose to parameterize the ICM based 
on the entropy because there is evidence from both obser- 
vations and simulations that it can be modelled as a simple 
power-law over most of the cluster volume (see Section l2T2l 
below). We combine this with an NFW parametrization of 
the dark matter halo component. By modelling the gas in 
this way, the underlying physics of cluster gas in hydro- 
static equilibrium is encoded into the model from the out- 
set, and is simple enough to be constrained by X-ray surface 
brightness and low-resolution SZ data. This also avoids ad- 
hoc parametrization of the electron density and temperature 
and therefore removes the possibility of introducing unphys- 
ical solutions for the total mass. The following section de- 
scribes in detail the physical basis and parametrization of 
this model. 



2.2 An Entropy-based model 

2.2.1 The Dark Mass Halo 

We construct a parametric model that is physically consis- 
tent and is therefore a reasonable description of the ICM 
on the angular scales to which CBI2 is sensitive. A suitable 
parametrization of the dark matter content is th e hierarchi- 
cal clustering NFW profile (|Navarro et al]|l995f) . where the 
dark matter (DM) density as a function of radius is given 
by 



Pdm(»") = 



8 c pc 



(r/r s )(l +r/r s 



(1) 



where p cr it is the universal critical density for closure (at 
the redshift of the cluster), r s is the scale radius and S c 
is the characteristic density contrast. The DM virial radius 
roM is defined in this model as the radius of a sphere at 
which the mean interior DM density is equal to 200p C rit- 
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Note that the true virial radius of the cluster is slightly 
larger (~ 10 percent) since one must also take into account 
the contribution from the gas when calculating the total 
mass. It is related to the scale radius by 



fDM 
CDM 



(2) 



where cdm is known as the DM concentrati on parameter. 
This is related to the DM density contrast (jNavarro et al.l 
\wM) by 



5c 



200 



'•DM 



3 [ln(l + cdm) - c D m/(1 + cdm)] ' 



(3) 



The DM density becomes increasingly large at small radii, 
and is unbounded at the centre of the cluster. However the 
enclosed DM mass is calculated by integrating this density 
over volume and this converges at the origin. The enclosed 
DM mass is given by the following expression 



A/dm(< r) = 4-7r<5 c p cr i t r a 



hi 



m-G 



(4) 



Therefore the DM mass distribution in the cluster is defined 
by the two parameters cdm and tdm- 

2.2.2 The Entropy 

In order to derive the electron pressure, and hence the ther- 
mal SZ decrement, a physical quantity is required that de- 
scribes the gaseous properties in a similar way for all clus- 
ters. The cluster entropy is expected to behave in a self- 
similar way for reasonably massive clusters, differing pri- 
marily in the cluster cores due to the injection of non- 
gravitational ther mal energy. The elect r on pressure and en- 



tropy observable (jPonman et al.l 1 19991 : iLlovd-Davies et al.l 



tropy 
2000) 



5'e 



are given by 

= S c n2 
T c 



7-1 ' 



(5) 
(6) 



where P c , S c , n c and T e are the electron pressure, entropy, 
density and temperature (in keV units) respectively. The 
parameter 7 is the adiabatic ratio of heat capacities and 
for an ideal monatomic gas is equal to 5/3. The observable 
quantity Sc is related to the true thermodynamic entropy 
per gas particle Sth by the following equation 



S th = Aln(S c ) + B y 



(7) 



where A and B are both functions of the funda mental and 
ideal gas constants. At small radii (r < 0.1r9nn. IVoit et al.l 
|2005j) the entropy profiles of clusters are expected to have 
a low-entropy core region as a result of radiative cooling of 
the gas. However excess entropy may also exist in this re- 
gion due to additional heating processes from central AGN 
and star formation. At the opposite end of the radial scale 
the accretion of matter at the interface between the cluster 
gas and the external medium generates an expanding shock- 
front, with an ever-increasing virial mass and hence infall- 
velocity. It is therefore expected that the entropy profile in- 
creases as a function of radius towards the edge of the clus- 
ter. Simulations of non-radiative clusters predict that the 
entropy profile should tend to a power law, where S e (r) oc 
r 11 (e.g. lTozzi fc NormanllJOol ; lKay||20o4 IVoit et al.ll2005l ; 



iMitchell et al.1 12009'1. X-ray observations to date are consis- 
tent with this predicted behaviour (see e .g. Ponm an et al 



| 2003 |: )Piffaretti et al.ir2005l:lDonahue et al|2006l: | Pratt et al 
pOOalMorandi fc Ettorill2007l ; IZhang et alJhooalPratt et~ 
[201fj) , and a rec ent study of the high r esolution Chandra 
Data Archive by ICavaenolo et al] (|2009J) has provided ev- 
idence of this expected behaviour in a collection of 239 
clusters. Therefore, for the purposes of providing a physi- 
cally motivated gas model, we choose the beta-model-like 
parametrization for the entropy profile 



(8) 



Sc(r) = SeO 1 + 



where S c o is the central entropy value, r corc is a scale ra- 
dius, and a is a parameter describing the behaviour at radii 
much larger than r cor c. In terms of the central electron den- 
sity and temperature, S c o — T e o/n e Q . Given the findings of 
the simulation and observational work outlined above, one 
would expect the entropy profile to be dominated by non- 
gravitational processes at radii smaller than r- CO rc ~ 0.1r2oo, 
and at larger radii follow a power-law S c oc r 2a , where 
a ~ 0.55. 

We note that recent X-ray observations of the ICM 
using Suzaku give evidence for a deviation in the en- 
tropy behaviour fro m the r 1 ' 1 power-law to flatter profiles 
at large radii (e.g. iGeorge et al. _ 20091: Bautz et al] |2009j; 



iHoshino et al] |2010| ; iKawaharada et al] I2010T ). Interpreta- 
tions of these results include significant deviation from hy- 
drostatic equilibrium and the effects of low thermalisation 
between electrons and ions in the cluster outskirts, especially 
whe re the edge of the cluster gas meets low-density regions 
(see IKawaharada et al] l2010l . and references therein) . The 
entropy-based model presented here can easily be adapted 
to account for this observed behaviour at large radii by the 
introduction of additional parameters. For the purposes of 
this work we do not adapt the model for the results of the 
Suzaku observations, however in future work on CBI2 data 
we will consider the entropy behaviour at the cluster out- 
skirts. 



2.2.3 The Pressure 

If we assume that the gas particles of the intra-cluster 
medium are in local thermodynamic equilibrium, then the 
condition of hydrostatic equilibrium relates the total en- 
closed mass (M) to the electron density and partial pressure 
by 



(9) 



dP e GM^im p n e (r) 

dr r 2 

where /j, is the mean molecular mass per gas particle, which 
has a value of 0.6 for a fully ionised plasma with cosmic 
hydrogen and helium abundance ratios. From the expression 
given in Equation(6] and assuming the ideal gas law holds for 
the electron pressure (where P c — n B T B ), the electron density 
as a function of pressure and entropy can be expressed as 



n c = (P e /5c 



\3/5 



(10) 



The pressure as a function of radius can then be derived by 
substituting the above expression for n e into Equation|9]and 
rearranging to give 
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P*(r) = 



,2/5 



■Gfim p 



Mdr' 



-i 5/2 



Se^V 2 



(11) 



where the central pressure, P c o, is equal to the product of the 
central density and temperature. The integral in the above 
expression can be solved numerically so that the electron 
pressure is calculated as a function of radius. The total mass 
is calculated by summing the enclosed DM and baryonic 
masses (note that the fraction of mass contained in stellar 
material is considered insignificant) at each radius out from 
the centre of the cluster. A value of 1.16 is assumed for 
the ratio of baryons to electrons, in order to calculate the 
baryonic mass. The pressure is then integrated in shells from 
the cluster centre to the required radius. 



2.3 SZ and X-ray Observables 

The SZ signal is a measure of the integrated line-of-sight 
electron pressure, and when integrated over the whole clus- 
ter is a measure of the total thermal energy. The observed 
change in CMB temperature as a result of the SZ effect is 
given by 



f(x)y, 



(12) 



ATsz 

TCMB 

where a; is a dimensionless frequency equal to /wV^bTcmb 
and y is the comptonisation parameter, which is equal to the 
integrated optical depth multiplied by the fractional gain in 
energy per scattering. Hence y is proportional to the integral 
of n e T c along the line of sight 



y 



ksT c 



ord/, 



(13) 



The frequency dependency f(x) also has a dependence on 
the electron temperature T B , given by 

f(x,T.)=(x^±±-4)(l + A(x,T.)), (14) 

where the relativistic correction A(x,T e ) is derived from 
highe r-order corrections t o the Kompa neets equation 
ijChallinor fc Lasenbvlfl99i : lltoh et allll998h . 

In the general case of modelling a non-isothermal cluster 
the product of pressure and f(x,T e ) is integrated along the 
line-of-sight. Replacing n c and T c by the radial dependence 
of the electron pressure, the expression for the change in the 
CMB temperature is given by 



ATsz = Tci. 



f(x,T e )^Zdl, 



(15) 



which is then solved numerically. 

The integral of pressure over the volume of the cluster 
is a measure of the total thermal energy and is therefore 
a useful quantity in characterising the overall thermal SZ 
effect. The integral of y over the solid angle of the cluster is 
an observable quantity that is often used for this purpose, 



Y 



v(fi)dn, 



(16) 



where Y is calculated within a given projected radius. How- 
ever, for the purposes of this work, we choose to use an 
intrinsic measure of Y that is independent of the distance 
to the cluster, and hence will allow comparison of clusters at 



different redshifts. We therefore assume spherical symmetry 
and use the following definition 



YDi = 4tt 



-PcOT 2, 

-r dr, 



(17) 



where Da is the angular diameter distance to the cluster 
and Y D\ is calculated over the cluster volume, enclosed by 
a physical radius typically equal to the virial radius. 

The observed continuum X-ray surface brightness from 
galaxy clusters is caused primarily by bremsstrahlung radi- 
ation from scattered electrons in the hot ICM, with some 
emission also from high energy transition lines. The con- 
tinuum X-ray emission is proportional to the projected 
square of the electron density, multiplied by the emissivity 
along the line-of-sight. If all of the emission were completely 
dominated by bremsstrahlung radiation then the emissivity 
would go as the square root of the temperature. While in 
practise this is not the complete case, in general the emis- 
sivity is a weak function of the temperature. The observed 
X-ray surface brightness per unit energy AE is given by the 
following expression 



dSx 
AE 



47r(l + z) 



n e A(E' ,T c )dl, 



(18) 



where z is the cluster redshift and A(E',T B ) is the X-ray 
spectral emissivity as a function of temperature and emitted 
energy, E' — E(l + z). The surface brightness for a particular 
observer-frame energy band is then given by the equation 



Sx = 



4tt(1 + z) 4 



n e A(T c )dl, 



where 



a(t s ; 



J A(E',T e )dE. 



(19) 



(20) 



In this work the emissivity values for every radial posi- 
tion in the cluster volume are interpolated from a look- 
up table of values, as a function of temperature and 
redshift, based upon the Raymond-Smith plasma model 
(jRavmond fc Smithl 1 19771 ) for an observer-frame energy 
range 0.5-2 keV and a metallicity of Z = 0.3 Z@. 

Combined observations of the SZ and X-ray surface 
brightness constrain the pressure and electron density of the 
ICM and hence provide constraints on all of the model pa- 
rameters. The physical properties of the cluster, including 
the total mass and electron temperature, as a function of 
radius are constrained by the 6 parameters defined by this 
model: c D m, r DM , n c0 , T e0 , r COTC and a. 



3 MODEL FITTING 

3.1 Bayesian inference of the parameter values 

We fit parameterized models to the SZ and X-ray data 
and obtain probability distributions for the derived phys- 
ical quantities. The joint probability distribution of a set of 
model parameters (8), given the data (d) and the model 
(A4j), can be calculated from Bayes' Theorem, 



Pr(8\d,M) 



Pr(d|9,.Mj)Pr(6>|.M j 
Pt(d\Mi) 



(21) 
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The probability of the data given the set of model parame- 
ters Pr(d\9, A4j) (the likelihood, L) can be calculated based 
on assumptions of the distribution of the error in the data. 
When the data set is large and therefore quasi-continuous 
(such as the noise generated in radio instrumentation), one 
can approximate the likelihood by the form given for Gaus- 
sian multivariate data (see e.g. ISiviall200q ) 



L = Pr(d\0,Mi] 

1 



(d-m)'C-i(d-m) (22) 



V(2tt)^|C| 

where A^ is equal to the size of d, C is the covariance matrix 
of the data, |C| is the determinant of the covariance matrix 
and m is the vector of model data. Since we wish to perform 
joint fits to k independent data sets, such that d — y] d^, 
then the total likelihood is just given by the product of the 
individual likelihoods. 

The probability of the parameter values given the 
model, Pr(0|A / !j), is known as the prior distribution, and 
encodes the prior information on the parameter values be- 
fore fitting to the data. For example, the uncertainty in the 
position of the cluster centroid might be known from pre- 
vious observations and from the pointing accuracy of the 
instrument. One might then wish to apply a Gaussian prior 
to the model cluster position based on the level of uncer- 
tainty. In this work the prior distributions for the model 
parameters are uniform distributions unless indicated oth- 
erwise, and comfortably encompass reasonable values based 
on previous work. 

The normalisation of the posterior probability distribu- 
tion is given by the evidence, which is equal to the proba- 
bility of the data given the model. The evidence is a mea- 
sure of the suitability of the model for fitting to the data 
and provides a quantitative measure of selecting between 
competing models. The evidence for the data, assuming a 
particular model, is calculated by marginalising the prod- 
uct of the likelihood and prior distributions over the model 
parameters. This is given by 



Pr(d\Mi 



Pr(d|0,Mj)Pr(0|Mj)d0, 



(23) 



which follows from Equation l21l and the fact that the inte- 
grated posterior distribution is normalised to unity. When 
a model provides a good fit to the data, the likelihood peak 
will have a high value, and hence the model will have a high 
evidence value associated with it. However if the model is 
over-complex then there will be large regions of low likeli- 
hood within the prior volume, thus reducing the evidence 
value for this model, in agreement with Occam's razor. This 
provides a mechanism to choose between competing models 
Mi. 



3.2 The SZ Likelihood 

The focus of this work is interferometric SZ data, which are 
assumed to have Gaussian distributed errors, and so have 
a likelihood that is expressed in the form given by Equa- 
tion !22l The SZ data consist of complex visibilities from the 
output of the correlator, which encode the Fourier transform 
of the product of the sky brightness and the primary beam 
of the instrument. The visibilities only sample the Fourier 



transform at discrete coordinates in the Fourier plane, de- 
fined by the configuration of the instrument antennas during 
the observation. 

Errors in the SZ data originate from three main sources; 
the thermal noise, the intrinsic CMB anisotropy and radio 
point sources. The total covariance matrix is thus given by 
the sum of these individual components, 



■ + C C 



+ C S . 



(24) 



The thermal noise component originates from the instru- 
ment electronics, the atmosphere and the ground, and since 
the noises are not correlated the covariance matrix for this 
component is diagonal. Experimental error in the SZ signal 
also arises from the presence of intrinsic fluctuations in the 
CMB that obscure the cluster. Since these fluctuations are 
inherent in the sky brightness, they are correlated in the vis- 
ibilities, and so result in a non-diagonal covariance matrix. 
The uncertainty due to the CMB is largest on the shortest 
CBI2 baselines a nd dominates the error in the SZ signal at 
I < 1000 (see e.g. lUdomprasert et alj|20o3 ). 

The remaining principal source of experimental error is 
the presence of bright radio point sources, especially near to 
the centre of the field, where there will be less attenuation 
from the primary beam and more confusion with compo- 
nents of the SZ signal. This error can be treated either by 
identifying the point sources in the long-baseline data or 
by simultaneously observing with higher resolution instru- 
ments. If one is confident that source variability is not an 
issue, then the source fluxes can be subtracted directly from 
the data and any residual flux errors incorporated into the 
covariance matrix. Alternatively, contamination due to point 
sources can be treated during the model fitting stage by the 
inclusion of a component of the covariance matrix corre- 
sponding to a source model. Scaling this component by a 
large pre-factor is approximately equivalent to masking out 
the data at the po sition of the source . This method is dis - 
cus sed in detail by iBond et al.l (|200Ch , iMason et all (|2003t) 
and lSievers et al.l (|2009f ). In the work presented here we con- 
sider only simulated mock data and so the presence of strong 
radio point sources are safely ignored. However in future 
work we will provide a full description of the point source 
treatment when presenting actual CBI2 SZ data. 



3.3 The X-ray Likelihood 

The raw X-ray surface brightness data are in the form of dis- 
crete photon counts distributed over a photon counting de- 
tector, and hence will have Poisson distributed errors. How- 
ever, since we assume that the cluster model is spherically 
symmetrical, we can radially bin the X-ray surface bright- 
ness so that the number of pixels contributing to each bin is 
typically greater than 100 over the radial range of interest. 
Under this assumption the likelihood is then well approxi- 
mated by the Gaussian form given by Equation[22] The er- 
rors in each radial bin value are uncorrelated and so the 
associated covariance is simply given by a diagonal matrix 
of the variance in the data. 
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3.4 Hyper-parameters and combining the data 

sets 

Radio interferometric SZ and X-ray surface brightness data 
typically have sensitivity to very different angular scales and 
depend differently on the physical properties that are being 
constrained. Therefore constructing an analytical physically- 
based model that provides a satisfactory simultaneous fit to 
both data sets is difficult, especially if we are unsure about 
the accuracy of the error estimates. It is therefore good prac- 
tise to weight the likelihoods for these data sets using two 
hyper-parameters, which can then be treated as nuisance 
parameters and marginalised over. The normalised modified 
Gaussian likelihood for the data now has the following ex- 
pression 



L 



Pr(d|0,e,Mj 



exp | -ei '—^- y - '- ) , (25) 



(2tt) N \C 

where e is the hyper-parameter. Its assumed that very lit- 
tle is known about the true value of e, except that it has 
an expectation value of unity. In this case a suitab l e prio r 
for each hyper-parameter is given bv Irlobson et al.l (J2002), 
where 



Pr(e) =exp(-e), 



(26) 



and e has a value between and oo. Note that if the Jef- 
freys prior were used instead, where Pr(e) = 1/e, then it 
would not be possible to normalise the posterior distribu- 
tion or calculate the evidence. By calculating the ratio of 
the Bayesian evidence for the case where hyper-parameters 
are used, to where they are not, it can be inferred whether 
they should be included. If this ratio is much larger than 
unity then there is a significant relative inconsistency in the 
error estimates of the two data sets and their inclusion is 
warranted. 



3.5 Implementation 

Impleme ntation of th e model fitting is performed using 
bayesys i|Skilling|l2004r ). a powerful Bayesian optimiser that 
uses the Markov Chain Monte Carlo method. Estimates of 
the posterior probability distribution are obtained for the 
model parameters by using all of the proposal distribution 
engines available to the program, bayesys provides the mod- 
elling code with parameter samples in the form of a unit 
hyper-cube from which the SZ and X-ray surface bright- 
ness likelihoods are calculated. The program initialises with 
a burn-in phase that evolves the MCMC chains from sam- 
pling just the prior distribution to locating regions of signif- 
icant posterior probability. This is performed in a series of 
steps known as selective annealing (due to the analogous re- 
lationship with annealing in thermodynamics), during which 
samples of parameters are drawn from a modified posterior 
given by 



Pr(0\d,Mi) ex Pr{d\9,MirPr(6\Mj 



(27) 



where the cooling factor (A) is slowly increased from to 
1. The rate of cooling (equal to the change in A over one 
step) should be much lower than unity in order to ensure 
that bayesys is robust to finding regions of high posterior 



value in large prior volumes. Therefore the rate of cooling 
is set to a value of 0.1 and multiple runs are performed 
with different seed values to check for good convergence. The 
robustness of the process is improved by using an ensemble 
of 10 simultaneous MCMC chains that communicate after 
each step and so ensure that they do not become trapped in 
local maxima of likelihood. 

After burn-in, the program draws samples from the pos- 
terior distribution at A = 1, where the total number of sam- 
ples drawn is equal to the product of the number of MCMC 
chains and steps. The number density of the model samples 
should then be a good estimate of the true posterior prob- 
ability distribution. The resulting output includes a list of 
the samples which can be marginalised over to obtain pos- 
terior distributions for the cluster properties, as well as an 
estimate of the evidence. 



4 TESTING THE MODEL 

We test the ability of the entropy-based model described in 
Section l2~2l to reproduce the known properties of galaxy clus- 
ters by fitting to simulated SZ and X-ray surface brightness 
data. We simulate SZ data from the CBI2 31 GHz interfer- 
ometer and typical X-ray surface brightness data from the 
EPIC camera on XMM-Newton. 



4.1 Mock CBI2 SZ data 

The sky signal for the simulated CBI2 observation is gen- 
erated from summing the SZ and intrinsic CMB compo- 
nents. The SZ thermodynamic temperature for a simulated 
galaxy cluster is calculated from the electron pressure us- 
ing Equation ll5l The CMB temperature map is constructed 
fr om an angular power spect rum generated using cmbfast 
(|Seliak fc Za_darriaga] ll996). assuming a flat ACDM cos- 
mology with n M = 0.3, n A = 0.7 and h = 0.7. The SZ 
and CMB thermodynamic temperature maps are converted 
to sky brightness units for each of the 10 frequency chan- 
nels (26-36 GHz) using the differential form of the Planck 
equation 

AI V 2k,BV 2 x, x ,x-2 /■„_,■> 

d^ = ~^ x e (e - 1) , (28) 

where T is the thermodynamic temperature and x — 
hv/kWTcus- The two brightness maps are then summed to 
form a combined sky simulation. 

The sky brightness map is multiplied by the CBI2 pri- 
mary beam and fast Fourier transformed to the (u,v) plane, 
where the visibility is then interpolated onto the positions 
given by a template observation. The noise weightings as- 
sociated with each visibility are then scaled based on the 
length of the simulated integration. The effect of the ground 
spill-over subtraction technique used in real CBI2 observa- 
tions (see e.g. [Udomprasert et al. 2004) is simulated by tak- 
ing the average of the visibilities from two maps with sepa- 
rate realisations of the CMB, and subtracting this from the 
mock data set. This will increase the noise in the data by a 
factor of -i/(3/2). Once this has been applied, the data are 
then in the same form as a real CBI2 data set. 



1 http://lambda.gsfc.nasa.gov/toolbox/tb_cmbfast_ov.cfm 
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Figure 1. The estimated prior probability density for the entropy-based model parameters. The constraints arise from the underlying 
assumptions about the cluster physics. The prior ranges encompass typical values of these parameters observed to date. The grey scale 
represents estimates of the 68 and 95 per cent intervals. 



4.2 



Mock X-ray surface brightness data 



The X-ray surface brightness is calculated from the known 
electron density and temperature distribution of a simulated 
cluster by using EquationQI|] The resulting surface bright- 
ness map is then multiplied by the count rate conversion 
factor of a typical instrument (the EPIC camera on XMM- 
Newton), which is assumed to be constant across the 0.5- 
2keV energy band. We use a typical neutral hydrogen col- 
umn density of 10 20 cm -2 , and an integration time equal to 
10 ks. The map is then multiplied by the solid angle per pixel 
(AJ7) to convert the data to the expected counts per pixel. 
The number of counts on each pixel is then related to the 
surface brightness and the pixel solid angle by the following 
expression 



JVcnts — 



S'x 



ergs~ 



( ^ 5 x 10 15 cnts. (29) 



V sr / 



Noise is introduced into the map by applying the Poisson 
distribution to each pixel, with the mean equal to the photon 
count. For the purposes of the mock observation we do not 
take into account the point spread function, the effect of 
which is assumed to be insignificant on the cluster scales that 
are being modelled. We also do not the take into account the 
residual background X-ray emission which would need to be 
considered for a real observation. 

The X-ray surface brightness profile data are then con- 
structed by binning the photon counts map into 5arcsec 
radial bins and converting from counts to physical units. 
The error in the mean value associated with each bin is just 
the square root of the bin value divided by the number of 
pixels that contributed to that radial bin. Hence at moder- 
ately large radii the number of pixels contributing to each 



bin is large and so the statistics can be approximated by a 
Gaussian distribution. 



5 RESULTS 

5.1 The Prior 

It is important to first understand the intrinsic constraints 
introduced by the prior before performing a joint fit to the 
data. The prior includes both the volume of the parameter 
space being considered and the assumptions upon which the 
model is based. Figure[l] shows the estimated prior distribu- 
tion for each of the parameters in the entropy-based model. 
This plot was constructed by running the analysis pipeline 
with all of the model constraints in place, but with no SZ 
or X-ray surface brightness data. The MCMC chains are al- 
lowed to search parameter space within the constraints of 
the model until a large number of samples are generated 
(~ 100, 000) and thus produce a reasonable estimation of 
the prior distribution. 

Constraints on the parameters arise from the assump- 
tion that the intra-cluster medium is in hydrostatic equilib- 
rium and that the pressure profile must be physical at all 
radii. The model also prefers higher central electron temper- 
ature values due to the constraint on the pressure. In order 
for the condition of hydrostatic equilibrium to be satisfied, 
large mass values can only correlate to relatively high tem- 
peratures, and this will provide a constraint on the value of 
tdm which is seen in the figure. The nature of interferometric 
data is such that they are not able to constrain a signal that 
is non-varying over the angular scales to which the instru- 
ment is sensitive. Therefore in the case of low signal-to-noise 
data it would be possible to obtain solutions within the ac- 
ceptable prior range of the model, where the temperature 
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increases rapidly at large radii and while the density pro- 
file decreases, leading to an SZ signal with a large constant 
additive component. To avoid this clearly unphysical situa- 
tion we impose the constraint that the electron temperature 
must either be constant or decreasing at radii larger than 
the virial radius. The principal effect of this constraint is to 
significantly reduce the probability of having an nonphysical 
high value for the entropy power law parameter a at large 
radii. 



5.2 Self-Consistency 

We construct mock CBI2 SZ and X-ray surface brightness 
data from the entropy-based model in order to test that, 
given idealised high signal-to-noise, the correct values for 
cluster properties can be returned from model fitting to the 
data. Two spherically-symmetric model clusters are con- 
structed for the self-consistency test, one with a cool-core 
and the other with a non-cool core. For this test the data 
have high signal-to-noise and the intrinsic CMB features are 
not included in the SZ signal. The estimated posterior prob- 
ability density for the temperature profile of each model 
cluster is shown in Figure[2] The results show that correct 
unbiased estimates of the cluster properties are reproduced 
by fitting the entropy-based model to idealised data con- 
structed from the same model. 



5.3 Hydrodynamic/N-body Simulations 

Cosmological JV-body/hydrod ynamic simulations of two 
clusters from lKav et al.l l|2008l ) are used to test the entropy- 
based mod el. The simulations w ere run the with the Gad- 
Gbt2 code (|Springel et alj2005T ) that uses the Particle- Mesh 
and tree algorithms to calculate gravitational forces and 
Smoothed Particle Hydrodynamics to model the gas. Ad- 
ditionally, the gas was allowed to cool radiatively, leading to 
a decrease in entropy, and form collisionless stars at low tem- 
perature (T < 10 5 K) and high density (n H > 10~ 3 cm~ 3 ). 
Thermal energy was also injected into such regions (feed- 
back) leading to a large increase in the entropy of the lo- 
cal gas. This phenomenological approach to re-h eating and 
galaxy formation in the cluster was found by iKav et al.l 
(2004) to reproduce the observed Lx — Tx relation, and so 
provides a good test for the entropy-based model. The sub- 
sample used in this work consists of two clusters with similar 
total masses (~ 10 Mq) and temperatures (~ 5keV), but 
with quite different merger histories. FBI has a cool core and 
is in a relaxed state, while FB3 is a non-cool core cluster and 
recently underwent a strong merger at z ~ 0.4. Therefore 
our model should not only be able to reproduce the global 
properties of each cluster, but also correctly constrain the 
different profiles at large radii. 

Maps of AT SZ (at 31 GHz) and Sx (in the 0.5 - 2.0 keV 
band) were generated for each cluster by calculating the elec- 
tron density and temperature properties for every hot par- 
ticle within a cylinder of length 6 r2oo and projected radius 
3r2oo- Each particle contribution is smoothed and projected 
along the length of the cylinder onto a 1024 x 1024 2D 
pixel array, using the projected version of the gadget2 SPH 
kernel. The SZ decrement and X-ray surface brightnesses 
were calculated using EquationsfTS] and 1191 The CBI2 SZ 



and X-ray surface brightness maps for each simulated clus- 
ter are shown in Figure[3] with realistic noise applied. Note 
that, for a fair comparison, the same A(T C ) cooling func- 
tion is used to generate the simulated X-ray surface bright- 
ness map that is also used when producing the aforemen- 
tioned analytical model. The maps are centred about the 
most gravitationally-bound dark matter particle and thus 
the most globally symmetric point. For the relaxed cluster 
FBI, this coincides with the brightest SZ and X-ray surface 
brightness points. In the case of FB3, this cluster underwent 
a recent merger event that has produced an asymmetric core 
region, and therefore the brightest pixels in the maps are 
offset from the centre. However the symmetry of the global 
structure of this cluster is centred about the central pixel of 
the map and not the fine substructure of the brightest peaks. 
This therefore acts to flatten the central part of the electron 
density profile, and hence also the X-ray surface brightness. 
The profiles for the physical properties of the simulated 
clusters are shown in Figure^ Clusters of galaxies typically 
depart from hydrostatic equilibrium within their core region 
due to physical processes other than gravitational collapse, 
such as disturbance from recent merger activity, pre-heating 
from active galactic nuclei and radiative cooling. Both sim- 
ulated clusters exhibit this behaviour within a radius of 
< 100 kpc, and their gas properties diverge considerably in 
this region. We therefore choose to ignore the central clus- 
ter core in fitting to the data since the high gas density, and 
hence X-ray surface brightness, will strongly bias the results. 

5.4 Results from simulated data 

Mock SZ data and X-ray surface brightness data are con- 
structed from the simulated cluster maps using the method 
described in sections 14.11 and 14.21 We fit the entropy-based 
model to both high and low signal-to-noise simulated data 
in order to investigate possible systematics introduced into 
the derived cluster properties. For the high signal-to-noise 
case we fit to idealised data with very low experimental er- 
ror, no calibration errors, and no intrinsic CMB signal in 
the SZ data. Conversely for low signal-to-noise data we sim- 
ulate larger experimental error, and introduce calibration 
error and an intrinsic CMB component in the SZ data. The 
calibration errors are introduced as nuisance parameters to 
be marginalised over and are distributed by a Gaussian prior 
with cr ca i given by the quoted error value. A calibration error 
of 5 per cent is used for the SZ data, typical of the calibra- 
tion of CBI2 visibility data, and the X-ray surface bright- 
ness data typically contain a calibration error of 10 per cent 
JAndersson fc Madeiskill2004h . 

Figure[S] shows the constraints introduced from sepa- 
rately fitting to high signal-to-noise SZ and X-ray data. The 
SZ data constrain the integrated line-of-sight pressure and 
therefore generate an anti-correlation between the central 
electron density and temperature parameters. In addition 
the SZ data reduce the likelihood of having large central 
electron density and temperature values since these would 
generate large SZ signals that are inconsistent with the data. 
The X-ray surface brightness data are proportional to the 
integrated square of the line-of-sight electron density and 
therefore strongly constrain the central electron density pa- 
rameter. The X-ray data is of relatively higher resolution 
than the SZ data and is therefore much more dependent 
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Figure 2. The estimated posterior probability density for the temperature as a function of radius, from joint fitting the entropy-based 
model to mock high signal-to-noise CBI2 SZ and X-ray surface brightness data, for cool core {left) and non-cool core {right) clusters 
constructed from the entropy-based model. The grey scale represents the 68, 95 and 99 per cent intervals. The dashed lines represent the 
true simulation values. 
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Figure 3. Mock observations of the X-r ay surface bright ness {grey-scale) and SZ temperature decrement {contours) , for the simulated 
clusters FBI {left) and FB3 {right) from lKav et al.l 1120081) . The grey-scale is in units of counts per pixel for a generic X-ray observation. 
The contours represent multiples of the 2 a noise level from a typical 31 GHz observation using the CBI2 array. The full-width half- 
maximum of the CBI2 synthesised beam is indicated in the bottom left-hand corner of each map. 



upon the shape of the entropy profile providing strong con- 
straints on r corc and a. The high signal-to-noise surface 
brightness data do not provide a strong constraint on the 
temperature and mass of the cluster. In the case of FBI the 
X-ray surface brightness data appear to generate a series of 
high likelihood peaks within parameter space. However the 
data are unable to distinguish between these in the absence 
of an additional constraint from the SZ data. 

Results from fitting simultaneously to both data sets, 
for both the high and low signal-to-noise cases, are shown in 
Table[T]and Figures[U]and[7] where estimates of the posterior 
distribution for the model parameters, profiles of the cluster 
properties and selected enclosed quantities are plotted. 

The model parameters are constrained by the combi- 
nation of both data sets. For the purposes of comparison 
with the known simulated cluster properties the radial pro- 
files are scaled by the known true r2oo value, and all global 
quantities are calculated by integrating within this radius. In 



the high signal-to-noise regime the radial profiles and global 
parameters of the true physical properties are reproduced 
well by the parametric fit, with systematic erro r < 10 per 
cent a t r2oo- This result is fairly consistent with IKav et al.l 
i|2004f ). who find that the hydrostatic mass estimated from a 
combination of /3 model fits to the X-ray surface brightness 
data and spatial temperature information agrees to within 
10 per cent (r < rsoo) for the simulated clusters which con- 
tain feedback. They find that the presence of feedback pro- 
duces a higher degree of thermalisation than if the cluster 
were simply described by a non-radiative model and so the 
estimated mass is close to the true value. In the low signal- 
to-noise regime the increase in noise leads to much larger 
widths in the probability distributions of the cluster quan- 
tities and, while shifting the peaks of the distributions, are 
still consistent with the known simulation values within the 



Tabled gives the natural logarithm of the evidence for 
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Figure 4. The physical properties of the simulated clusters FBI {dashed line) and FB3 (solid line) as a function of radius. The dot- 
dashed line in the frame showing the radial entropy represents the power law predicted by simulations. The dot-dashed line in the frame 
showing the ratio of the pressure gradient to gravity represents a cluster in hydrostatic equilibrium. FBI is cool-core relaxed cluster and 
FB3 is non cool-core. 
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Table 1. Results from joint entropy-based model fits to the sim- 
ulated clusters. Column 1 gives the cluster name; column 2 the 
data description; column 3 the integrated comptonisation within 
the virial radius, in units of 10 -4 Mpc 2 ; column 4 the total mass 
within the virial radius, in units of 1O 14 M0, column 5 the gas 
mass fraction within the virial radius. Errors represent the 68 per 
cent interval. 



different models given the data. The evidence values for both 
low and high signal-to-noise regimes support the inclusion 
of hyper-parameters. The ratios of best fit values for the 
SZ and X-ray hyper-parameters in each regime are 15:1 and 
3.5:1 (high signal-to-noise), and 2.5:1 and 1.1:1 (low signal- 
to- noise), for FBI and FB3 respectively, indicating that the 
hyper-parameters weight the likelihood in favour of the SZ 
data. While the CBI2 SZ data are insensitive to variation 
on scales smaller than 500 kpc, the X-ray data are much 
more sensitive to variations over this range. The nature of 
the parametric model is such that it is inherently smooth on 
these smaller scales; hence clumping of the gas will deviate 
the model from the X-ray data, while still producing a rel- 
atively good fit to the SZ data. Therefore the weights will 
typically favour the SZ data relative to the X-ray surface 
brightness. In the case of the FB3 data, the relative weight- 



ings are more similar since the X-ray data are a closer match 
to the model over the considered radius range. A reduction 
in signal-to-noise leads to a decrease in the ratios of hyper- 
parameters for each cluster, and so both the SZ and X-ray 
data sets provide a more consistent match to the model. 



5.5 Comparison with the isothermal /3 model 

Even though the simulated clusters are clearly not isother- 
mal, for the purposes of comparison to previous work and 
to demonstrate the systematic differences in derived clus- 
ter properties using the two different models, we perform 
joint fits using the single isothermal /3 model. The electron 
density and temperature are then given by 



n e0 1 + 



\ ' core / 



-3/3/2 



T c — T c t 



(30) 
(31) 



where n c o and T e o are the central electron density and tem- 
perature respectively, r corc is the core radius, and /? is a 
parameter that determines the large scale behaviour of the 
electron density. 

Figures [8] and [9] show the resulting posterior distribu- 
tions for the model parameters, the profiles, and global val- 
ues within r2oo- There is a strong systematic difference be- 
tween the estimated and true simulation values when us- 
ing this model, resulting from over-estimates of the elec- 
tron temperature and SZ decrement. The result is a sys- 
tematic over-estimate of the total mass of ~ 20 per cent at 
r2oo and an under-estimate of the total mass at r_<_rsoo. 
These results are consistent with the findings bv lKav et al.l 
(2004) who measure similar errors in estimating the total 
mass based upon the isothermal model. The systematic er- 
ror in derived cluster properties is seen in both the high and 
low signal-to-noise scenarios, and therefore the introduction 
of both thermal noise and intrinsic CMB anisotropy is not 
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Figure 5. The estimated posterior probability density for the entropy-based model parameters from separately fitting to simulated high 
signal-to-noise CBI2 SZ data (top triangle) and X-ray surface brightness data (bottom triangle). The top and bottom plots represent fits 
for cluster FBI and FB3 respectively. The grey-scale represents the 68 and 95 per cent intervals. 
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Figure 6. The estimated posterior probability density for the entropy-based model parameters from joint fitting to simulated high 
signal-to-noise (top triangle) and low signal-to-noise (bottom triangle) CBI2 SZ and X-ray data. The top and bottom plots represent fits 
for clusters FBI and FB3 respectively. The grey-scale represents the 68 and 95 per cent intervals. 
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Figure 7. The estimated posterior probability density for properties of Cluster FBI from joint fitting the entropy-based model to 
simulated high signal-to-noise (top) and low signal-to-noise (bottom) CBI2 SZ and X-ray data for cluster FBI. The grey scale represents 
the 68, 95 and 99 per cent intervals. The dashed lines represent the true simulation values and the error-bars the mock data. 
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Figure 7 - continued Cluster FB3. 
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Figure 9. The estimated posterior probability density for properties of Cluster FBI from joint fitting the isothermal f3 model to simulated 
high signal-to-noise (top) and low-signal-to-noise (bottom) CBI2 SZ and X-ray data for cluster FBI. The grey scale represents the 68, 95 
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Figure 9 - continued Cluster FB3. 
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High SNR 


Low SNR 


Name 


Model 


-Mo 


Mi 


Mo Mi 


FBI 


Entropy 


6113 


6476 


4757 4764 




Isothermal 


5946 


6405 


4559 4724 


FB3 


Entropy 


6147 


6273 


4807 4816 




Isothermal 


6131 


6231 


4810 4817 



Table 2. The natural logarithm of the evidence (the probabil- 
ity of the data given a particular parametric model, see Equa- 
tion l23t for high and low signal-to-noise CBI2 SZ and X-ray sur- 
face brightness data. Note that the absolute value of the evidence 
is dependent upon the data, and so the quantity of interest is 
the difference between these values. Models that include SZ and 
X-ray hyper-parameters are represented by Mi and those that 
do not are represented by Mo- 



enough to dominate over the effects of the intrinsic isother- 
mal /3 model discrepancy with the simulations. 

It is informative to compare the relative quality of the fit 
that the entropy-based and single isothermal /3 models give, 
based upon their respective logarithmic evidence values. In 
almost all the cases the evidence for the isothermal /3 model 
is significantly lower than that of the entropy-based model, 
with the exception of the low signal-to-noise scenario for the 
FB3 simulated cluster. This is due to the flat core nature 
of the X-ray surface brightness profile for this cluster, as 
a result of the displacement of the gas peak in the central 
region. As such when significantly lower signal-to-noise data 
is used the isothermal j3 model, with its flat core behaviour 
at small radii, provides an equally good fit (if not slightly 
better) than the entropy-based model. However in the case 
of the FBI simulated cluster, the X-ray profile is strongly 
peaked in the central region and therefore the isothermal fi 
model provides a significantly poorer fit to both high and low 
signal-to-noise data. This can be seen upon visual inspection 
of the electron density profiles for FBI in Figure(9] where 
the isothermal /3 model fails to provide a suitable fit at both 
small and large radii. 

The evidence values in Table[2] support the inclusion of 
hyper-parameters in both the high and low signal-to-noise 
scenarios for the isothermal /3 model. The relative weightings 
of each the SZ and X-ray data sets are to 30:1 (FBI) and 3:1 
(FB3) for the high signal-to- noise data, and 4:1 and 1.1:1 for 
the corresponding low signal-to-noise data. The likelihood 
calculation is therefore weighted in favour of the SZ data, 
except in the case of the low signal-to- noise FB3 data where 
both are almost equally favoured. 



6 DISCUSSION AND CONCLUSIONS 

We have developed a parametric model for the gas in galaxy 
clusters, based on three physical assumptions: the dark mat- 
ter follows an NFW profile, the gas entropy can be described 
by a power law with a flattened core, and the gas is in hydro- 
static equilibrium. Using entropy rather than, for example, 
the gas density as the basic parametrization is motivated by 
the theoretical and observed self-similarity of entropy pro- 
files in cluster samples. This model can be constrained by SZ 
and X-ray data to give fitted gas and total matter properties 
of the cluster. The model has sensible convergence proper- 



ties and can be used out to the virial radius of the cluster. 
By construction, the model does not allow unphysical or 
inconsistent properties of the cluster gas as can happen if, 
for example, a parametric fit to the gas density is combined 
with an unrelated parametric fit to the temperature. 

We have tested the model using two detailed N-body 
plus hydrodynamical simulations of massive clusters with 
contrasting merger histories. In both cases, using realistic 
mock data from presently available X-ray and SZ telescopes, 
the model is able to accurately fit both the integrated clus- 
ter parameters and their radial profiles. If high-quality data 
with very low noise are simulated, the cluster parameters are 
returned with essentially no bias. Our fitting code includes 
both random noise and systematic calibration errors in the 
data and fully includes the effect of contamination from pri- 
mordial CMB fluctuations and radio sources in the SZ data. 
We also use a hyper-parameter approach to scale the rela- 
tive constraints from the two data sets. Comparison with the 
widely-used isothermal ft model confirms previous results 
that this model can result in significant biases in fitted clus - 
ter parameters (e.g. iKav et al.ll20ol : lHallman et alJl2007f ). 
The quality of the available SZ data is now high enough to 
require a more sophisticated modelling approach, especially 
with data that are sensitive to the outskirts of cluster. 

This model however remains simplistic in several poten- 
tially important ways. The assumption of hydrostatic equi- 
librium is clearly broken badly in the central cores of clus- 
ters, and we are forced to ignore the data in this region. Hy- 
drostatic equilibrium will also be broken in the main body of 
the cluster due to bulk motions and other non-thermal sup- 
port, although in our simulations this does not seem to be 
a significant impediment to measuring accurate cluster pro- 
files. We do not currently treat the boundary of the cluster 
in a fully consistent way - at some radius the virialised gas 
must meet a boundary shock of in-falling material and we 
do not model the corresponding step in pressure. We also do 
not yet consider additional observation constraints such as 
X-ray spectral and optical weak lensing measurements, al- 
though these are in principle straightforward to incorporate 
in to our analysis framework. 

In subsequent papers we will use this model to analyse 
SZ data from the CBI2 experiment jointly with relevant X- 
ray imaging data. 
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